A Peer-to-Peer Inverted Index Implementation for Word-based Content Search

نویسنده

  • Nuno Lopes
چکیده

For the last few years Peer-to-Peer (P2P) systems have been used for cooperative file sharing among users. Napster was the first system that allowed the exchange of music files through the collaboration of it’s peers. P2P systems make an effective use of the widespread availability of computing and storage resources that are present over the Internet, even though temporarily. The ability to search information is of great importance in order to take full advantage of such huge amount of shared information. The first generation of P2P systems, like Napster [4] and Gnutella [3], uses a word based searching model. Users search for files that match a given word or list of words. Napster makes use of a central index for searching, despite file transfers being made only between peers. A centralized index creates a single point of failure on the system, which can be used to shutdown the system. Gnutella was the first system to use a completely decentralized networking model. Searching is made through neighborhood broadcasts, which create a considerable amount of network traffic. Therefore, this system does not scale to a very large number of users [6]. The second generation of P2P systems, like Chord [9] and Pastry [7], is based on the Distributed Hash Table (DHT) model. A DHT system forms an overlay network where each peer is given an unique identifier. The identifier has constant size and may be generated as the result of an hashing function. This identifier is also used as the key of the (key, value) pair each peer will store. The distributed hash table is formed by all the pairs contained on peers. The overlay network is capable of locating any key of the hash table, and therefore the peer storing it, from any other peer in a scalable way. The routing procedure required for locating

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Planning, Implementation and Evaluation of Active Learning- Teaching Methods in Inverted Classroom,Peer Learning and Virtual Education among Students in the School of Allied Medical Sciences

Introduction: Education in the field of medical sciences requires using modern learning- teaching methods to enable students to be active in this process and to cultivate creative and thoughtful students. The aim of this study was to design, implement and evaluate active learning- teaching methods in the form of flipped classroom, peer learning and virtual education among paramedical students. ...

متن کامل

Efficient Peer-to-Peer Keyword Searching

Today, exponential growth in network content makes it difficult to build and maintain a complete document index to support efficient search. Centralized search services must actively and repeatedly probe the network for new or changed content. The scope and rapid evolution of the Internet means that even the best pull-based search services will always be incomplete and inaccurate. Recently, how...

متن کامل

Proof: A Novel DHT-Based Peer-to-Peer Search Engine

In this paper we focus on building a large scale keyword search service over structured Peer-to-Peer (P2P) networks. Current stateof-the-art keyword search approaches for structured P2P systems are based on inverted list intersection. However, the biggest challenge in those approaches is that when the indices are distributed over peers, a simple query may cause a large amount of data to be tran...

متن کامل

A Novel Method for Content Consistency and Efficient Full-text Search for P2P Content Sharing Systems

A problem associated with current P2P (peer-to-peer) systems is that the consistency between copied contents is not guaranteed. Additionally, the limitation of fulltext search capability in most of the popular P2P systems hinders the scalability of P2P-based content sharing systems. We proposed a new P2P content sharing system in which the consistency of contents in the network is maintained af...

متن کامل

Text-Based Content Search and Retrieval in Ad-hoc P2P Communities

We consider the problem of content search and retrieval in peer-to-peer (P2P) communities. P2P computing is a potentially powerful model for information sharing between ad hoc groups of users because of its low cost of entry and natural model for resource scaling with community size. As P2P communities grow in size, however, locating information distributed across the large number of peers beco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003